Calibration of Polytomous Item Families Using Bayesian Hierarchical Modeling

نویسندگان

  • Matthew S. Johnson
  • Sandip Sinharay
  • Andreas Oranje
  • Isaac Bejar
  • Randy Bennett
  • Paul Holland
  • Alina von Davier
  • Hariharan Swaminathan
  • Shelby Haberman
چکیده

For complex educational assessments, there is an increasing use of item families, which are groups of related items. However, calibration or scoring for such an assessment requires fitting models that take into account the dependence structure inherent among the items that belong to the same item family. Glas and van der Linden (2001) suggest a Bayesian hierarchical model to analyze data involving item families with multiple-choice items. This paper extends the model to take into account item families with polytomous items and designs a Markov chain Monte Carlo (MCMC) algorithm for the Bayesian estimation of the model parameters. The hierarchical model, which accounts for the dependence structure inherent among the items, implicitly defines the family response function for the score categories. This paper suggests a way to combine the family response functions over the score categories to obtain a family score function, which is a quick graphical summary of the expected score of an individual with a certain ability on an item randomly generated from an item family. This paper also suggests a method for the Bayesian estimation of the family response function and family score function. This work is a significant step towards building a tool to analyze data involving item families and may be very useful practically, for example, in automatic item generation systems that create tests involving item families.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A comparison study of IRT calibration methods for mixed-format tests in vertical scaling

The purpose of this dissertation was to investigate how different Item Response Theory (IRT)-based calibration methods affect student achievement growth pattern recovery. Ninety-six vertical scales (4 × 2 × 2 × 2 ×3) were constructed using different combinations of IRT calibration methods (separate, pair-wise concurrent, semiconcurrent, and concurrent), lengths of common-item set (10 vs. 20 com...

متن کامل

Formulating latent growth using an explanatory item response model approach.

In this paper, we present a way to extend the Hierarchical Generalized Linear Model (HGLM; Kamata (2001), Raudenbush (1995)) to include the many forms of measurement models available under the formulation known as the Random Coefficients Multinomial Logit (MRCML) Model (Adams, Wilson and Wang, 1997), and apply that to growth modeling. First, we review two different traditions in modeling growth...

متن کامل

Assessing IRT Model-Data Fit for Mixed Format Tests

This study examined various model combinations and calibration procedures for mixed format tests under different item response theory (IRT) models and calibration methods. Using real data sets that consist of both dichotomous and polytomous items, nine possibly applicable IRT model mixtures and two calibration procedures were compared based on traditional and alternative goodnessof-fit statisti...

متن کامل

Personalized cardiorespiratory fitness and energy expenditure estimation using hierarchical Bayesian models

Accurate estimation of energy expenditure (EE) and cardiorespiratory fitness (CRF) is a key element in determining the causal relation between aspects of human behavior related to physical activity and health. In this paper we estimate CRF without requiring laboratory protocols and personalize energy expenditure (EE) estimation models that rely on heart rate data, using CRF. CRF influences the ...

متن کامل

Effect of item order on item calibration and item bank construction for computer adaptive tests

Item banks are typically constructed from responses to items that are presented in one fixed order; therefore, order effects between subsequent items may violate the independence assumption. We investigated the effect of item order on item bank construction, item calibration, and ability estimation. 15 polytomous items similar to items used in a pilot version of a computer adaptive test for anx...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003